The Challenges and Pitfalls of Arabic Romanization and Arabization
نویسنده
چکیده
The high level of ambiguity of the Arabic script poses special challenges to developers of NLP tools in areas such as morphological analysis, named entity extraction and machine translation. These difficulties are exacerbated by the lack of comprehensive lexical resources, such as proper noun databases, and the multiplicity of ambiguous transcription schemes. This paper focuses on some of the linguistic issues encountered in two subdisciplines that play an increasingly important role in Arabic information processing: the romanization of Arabic names and the arabization of nonArabic names. The basic premise is that linguistic knowledge in the form of linguistic rules is essential for achieving high accuracy.
منابع مشابه
Internationalization of a Distance Exam Web Environment
This paper describes an architecture to provide multilingual support for an exam Web environment with an emphasis on Arabic localization (or Arabization). Developing software products for populations with different cultures is a two-step process: internationalization followed by localization. Proper Arabization is particularly complex with many interesting challenges. In this context, we develo...
متن کاملA Novel Method to Evaluate Romanization Systems: The Case of Romanizing Arabic Proper Nouns
The transliteration of Arabic proper nouns to other languages is usually based on the phonetic translation of these nouns into their phonetic Latin counterparts. Most of the dictionaries do not include most of these nouns, although some may have meanings. Transliteration is essential generally to Natural Language Processing (NLP) field and specifically to machine translation systems, cross-lang...
متن کاملInvestigating the challenges of teaching and learning Arabic in the high schools of Zabol County1
Purpose: This paper aims to evaluate the teaching and learning processes of the Arabic course in the high schools of Zabol County. Methodology: Descriptive-correlational method was applied as the research method and the statistical population was comprised of two groups – students and teachers of Arabic course. It had a practical aim and relies on the general hypothesis stating that the Arabic ...
متن کاملIMPACTS AND CHALLENGES OF CLOUD COMPUTING FOR SMALL AND MEDIUM SCALE BUSINESSES IN NIGERIA
Cloud computing technology is providing businesses, be it micro, small, medium, and large scale enterprises with the same level playing grounds. Small and Medium enterprises (SMEs) that have adopted the cloud are taking their businesses to greater heights with the competitive edge that cloud computing offers. The limitations faced by (SMEs) in procuring and maintaining IT infrastructures has be...
متن کاملLanguage barriers in medical education and attitudes towards Arabization of medicine: student and staff perspectives.
Students and staff perspectives on language barriers in medical education in Egypt and their attitude towards Arabization of the medical curriculum were explored in a questionnaire survey of 400 medical students and 150 staff members. Many students (56.3%) did not consider learning medicine in English an obstacle, and 44.5% of staff considered it an obstacle only in the 1st year of medical scho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007